Search CORE

17 research outputs found

From Questions to Effective Answers: On the Utility of Knowledge-Driven Querying Systems for Life Sciences Data

Author: Asiaee Amir H.
Doshi Prashant
Minning Todd
Parikh Priti
Sahoo Satya
Sheth Amit
Tarleton Rick L.
Publication venue
Publication date: 01/10/2012
Field of study

We compare two distinct approaches for querying data in the context of the life sciences. The first approach utilizes conventional databases to store the data and intuitive form-based interfaces to facilitate easy querying of the data. These interfaces could be seen as implementing a set of "pre-canned" queries commonly used by the life science researchers that we study. The second approach is based on semantic Web technologies and is knowledge (model) driven. It utilizes a large OWL ontology and same datasets as before but associated as RDF instances of the ontology concepts. An intuitive interface is provided that allows the formulation of RDF triples-based queries. Both these approaches are being used in parallel by a team of cell biologists in their daily research activities, with the objective of gradually replacing the conventional approach with the knowledge-driven one. This provides us with a valuable opportunity to compare and qualitatively evaluate the two approaches. We describe several benefits of the knowledge-driven approach in comparison to the traditional way of accessing data, and highlight a few limitations as well. We believe that our analysis not only explicitly highlights the specific benefits and limitations of semantic Web technologies in our context but also contributes toward effective ways of translating a question in a researcher's mind into precise computational queries with the intent of obtaining effective answers from the data. While researchers often assume the benefits of semantic Web technologies, we explicitly illustrate these in practice

arXiv.org e-Print Archive

Crossref

Scholar Commons - Institutional Repository of the University of South Carolina

CORE

A Semantic Problem Solving Environment for Integrative Parasite Research: Identification of Intervention Targets for Trypanosoma cruzi

Author: A Bernstein
A Brazma
A Ruttenberg
A Ruttenberg
AA Ackermann
Aaron R. Jex
Amir H. Asiaee
Amit P. Sheth
B Chukualim
BM Good
C Aurrecoechea
C Bizer
C Blaschke
C Goble
C Goble
C Hertz-Fowler
D Xu
E Antezana
E Sirin
H Dietze
H Lam
H Tang
J Bhagat
J Luciano
J Malone
JA Atwood
JC Jeremy
K Christoph
K Eilbeck
K-H Cheung
M Ashburner
M Aslett
M Johnson
M Kanehisa
NM El-Sayed
P Hitzler
P Mendes
PR Smart
Prashant Doshi
Priti P. Parikh
R Brinkman
Rick Tarleton
Sarasi Lalithsena
Satya S. Sahoo
SS Sahoo
SS Sahoo
SS Sahoo
SS Sahoo
T Minning
TA Minning
Todd A. Minning
V Cross
V Petri
Vinh Nguyen
W Hersh
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

Effective research in parasite biology requires analyzing experimental lab data in the context of constantly expanding public data resources. Integrating lab data with public resources is particularly difficult for biologists who may not possess significant computational skills to acquire and process heterogeneous data stored at different locations. Therefore, we develop a semantic problem solving environment (SPSE) that allows parasitologists to query their lab data integrated with public resources using ontologies. An ontology specifies a common vocabulary and formal relationships among the terms that describe an organism, and experimental data and processes in this case. SPSE supports capturing and querying provenance information, which is metadata on the experimental processes and data recorded for reproducibility, and includes a visual query-processing tool to formulate complex queries without learning the query language syntax. We demonstrate the significance of SPSE in identifying gene knockout targets for T. cruzi. The overall goal of SPSE is to help researchers discover new or existing knowledge that is implicitly present in the data but not always easily detected. Results demonstrate improved usefulness of SPSE over existing lab systems and approaches, and support for complex query design that is otherwise difficult to achieve without the knowledge of query language syntax

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Scholar Commons - Institutional Repository of the University of South Carolina

CORE

From Questions to Effective Answers: On the Utility of Knowledge-Driven Querying Systems for Life Sciences Data

Author: Asiaee Amir H.
Doshi Prashant
Minning Todd
Parikh Priti
Sahoo Satya S.
Sheth Amit P.
Tarleton Rick L.
Publication venue: CORE Scholar
Publication date: 01/01/2010
Field of study

We compare two distinct approaches for querying data in the context of the life sciences. The first approach utilizes conventional databases to store the data and intuitive form-based interfaces to facilitate easy querying of the data. These interfaces could be seen as implementing a set of \u27pre-canned\u27 queries commonly used by the life science researchers that we study. The second approach is based on semantic Web technologies and is knowledge (model) driven. It utilizes a large OWL ontology and same datasets as before but associated as RDF instances of the ontology concepts. An intuitive interface is provided that allows the formulation of RDF triples-based queries. Both these approaches are being used in parallel by a team of cell biologists in their daily research activities, with the objective of gradually replacing the conventional approach with the knowledge-driven one. This provides us with a valuable opportunity to compare and qualitatively evaluate the two approaches. We describe several benefits of the knowledge-driven approach in comparison to the traditional way of accessing data, and highlight a few limitations as well. We believe that our analysis not only explicitly highlights the specific benefits and limitations of semantic Web technologies in our context but also contributes toward effective ways of translating a question in a researcher\u27s mind into precise computational queries with the intent of obtaining effective answers from the data. While researchers often assume the benefits of semantic Web technologies, we explicitly illustrate these in practice

Scholar Commons - Institutional Repository of the University of South Carolina

CORE

From Questions to Effective Answers: On the Utility of Knowledge-Driven Querying Systems for Life Sciences Data

Author: Asiaee Amir H.
Doshi Prashant
Minning Todd
Parikh Priti
Sahoo Satya S.
Sheth Amit P.
Tarleton Rick L.
Publication venue: SelectedWorks
Publication date: 07/10/2014
Field of study

We compare two distinct approaches for querying data in the context of the life sciences. The first approach utilizes conventional databases to store the data and provides intuitive form-based interfaces to facilitate querying of the data, commonly used by the life science researchers that we study. The second approach utilizes a large OWL ontology and the same datasets associated as RDF instances of the ontology. Both approaches are being used in parallel by a team of cell biologists in their daily research activities, with the objective of gradually replacing the conventional approach with the knowledge-driven one. We describe several benefits of the knowledge-driven approach in comparison to the traditional one, and highlight a few limitations. We believe that our analysis not only explicitly highlights the benefits and limitations of semantic Web technologies in the context of life sciences but also contributes toward effective ways of translating a question in a researcher\u27s mind into precise queries with the intent of obtaining effective answers

CORE

From Questions to Effective Answers: On the Utility of Knowledge-Driven Querying Systems for Life Sciences Data

Author: Asiaee Amir H.
Doshi Prashant
Minning Todd
Parikh Priti
Sahoo Satya S.
Sheth Amit P.
Tarleton Rick L.
Publication venue: SelectedWorks
Publication date: 06/02/2016
Field of study

We compare two distinct approaches for querying data in the context of the life sciences. The first approach utilizes conventional databases to store the data and intuitive form-based interfaces to facilitate easy querying of the data. These interfaces could be seen as implementing a set of \u27pre-canned\u27 queries commonly used by the life science researchers that we study. The second approach is based on semantic Web technologies and is knowledge (model) driven. It utilizes a large OWL ontology and same datasets as before but associated as RDF instances of the ontology concepts. An intuitive interface is provided that allows the formulation of RDF triples-based queries. Both these approaches are being used in parallel by a team of cell biologists in their daily research activities, with the objective of gradually replacing the conventional approach with the knowledge-driven one. This provides us with a valuable opportunity to compare and qualitatively evaluate the two approaches. We describe several benefits of the knowledge-driven approach in comparison to the traditional way of accessing data, and highlight a few limitations as well. We believe that our analysis not only explicitly highlights the specific benefits and limitations of semantic Web technologies in our context but also contributes toward effective ways of translating a question in a researcher\u27s mind into precise computational queries with the intent of obtaining effective answers from the data. While researchers often assume the benefits of semantic Web technologies, we explicitly illustrate these in practice

CORE

A Semantic Problem Solving Environment for Integrative Parasite Research: Identification of Intervention Targets for \u3cem\u3eTrypanosoma cruzi\u3c/em\u3e

Author: Asiaee Amir H.
Doshi Prashant
Lalithsena Sarasi
Minning Todd
Nguyen Vinh
Parikh Priti
Sahoo Satya S.
Sheth Amit P.
Tarleton Rick L.
Publication venue: SelectedWorks
Publication date: 03/10/2014
Field of study

Background: Research on the biology of parasites requires a sophisticated and integrated computational platform to query and analyze large volumes of data, representing both unpublished (internal) and public (external) data sources. Effective analysis of an integrated data resource using knowledge discovery tools would significantly aid biologists in conducting their research, for example, through identifying various intervention targets in parasites, and in deciding the future direction of ongoing as well as planned projects. A key challenge in achieving this objective is the heterogeneity between the internal lab data, usually stored as flat files, Excel spreadsheets or custom-built databases, and the external databases. Reconciling the different forms of heterogeneity and effectively integrating data from disparate sources is a nontrivial task for biologists and requires a dedicated informatics infrastructure. Thus, we developed an integrated environment using Semantic Web technologies that may provide biologists the tools for managing and analyzing their data, without the need for acquiring in-depth computer science knowledge. Methodology/Principle Findings: We developed a semantic problem-solving environment (SPSE) that uses ontologies to integrate internal lab data with external resources in a Parasite Knowledge Base (PKB), which has the ability to query across these resources in a unified manner. The SPSE includes Web Ontology Language (OWL)-based ontologies, experimental data with its provenance information represented using the Resource Description Format (RDF), and a visual querying tool, Cuebee, that features integrated use of Web services. We demonstrate the use and benefit of SPSE using example queries for identifying gene knockout targets of Trypanosoma cruzi for vaccine development. Answers to these queries involve looking up multiple sources of data, linking them together and presenting the results. Conclusion/Significance: The SPSE facilitates parasitologists in leveraging the growing, but disparate, parasite data resources by offering an integrative platform that utilizes Semantic Web techniques, while keeping their workload increase minimal

CORE